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Abstract 

o 

CNJ ■ Let K, be a simplicial complex and g the rank of its p-th homology group H p (/C) defined with Z2 coef- 

ficients. We show that we can compute a basis H of H p (/C) and annotate each p-simplex of K, with a 
C$ " binary vector of length g with the following property: the annotations, summed over all p-simplices in 

any p-cycle z, provide the coordinate vector of the homology class [z] in the basis H. The basis and the 
annotations for all simplices can be computed in 0(n u ) time, where n is the size of JC and u < 2.376 is 
a quantity so that two n x n matrices can be multiplied in 0(n u ) time. The pre-computation of annota- 
tions permits answering queries about the independence or the triviality of p-cycles efficiently. 

rn ■ Using annotations of edges in 2-complexes, we derive better algorithms for computing optimal basis and 

f) , optimal homologous cycles in 1-dimensional homology. Specifically, for computing an optimal basis of 

Hi(/C), we improve the time complexity known for the problem from 0(n 4 ) to 0{n" + n 2 g^ _1 ). Here 
q , n denotes the size of the 2-skeleton of /C and g the rank of H 1 (/C) . Computing an optimal cycle homolo- 

gous to a given 1-cycle is NP-hard even for surfaces and an algorithm taking 2°^ 9 'n log n time is known 
for surfaces. We extend this algorithm to work with arbitrary 2-complexes in 0{n u ) + 2°^n 2 logn 
£>. ■ time using annotations. 
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!>■ ■ 1 Introduction 

en; 

Cycles play a fundamental role in summarizing the topological information about the underlying space 
that a simplicial complex represents. Homology groups are well known algebraic structures that capture 
topology of a space by identifying equivalence classes of cycles. Consequently, questions about homological 
characterizations of input cycles often come up in computations dealing with topology. For example, to 
compute a shortest basis of a homology group with a greedy approach, one has to test several times whether 
the cycles in a given set are independent. To determine the topological complexity of a given cycle, a 
first level test could be deciding if it is null homologous, or equivalently if it is a boundary. Recently, 
a number of studies have been done that concern with the computation of such topological properties of 

cycles (2 a H El M M E2. 

Two optimization questions about cycles have caught the attention of researchers because of their rele- 
vance in applications: (i) compute an optimal homology basis, which asks to compute a set of cycles that 
form a basis of the corresponding homology group and whose weight is minimum among all such basis; 
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(ii) compute an optimal homologous cycle, which asks to compute a cycle with minimum weight in the 
homology class of a given cycle. Chen and Freedman Q have shown that both problems for p-dimensional 
cycles, p-cycles in short, with p > 1, are NP-hard to approximate within constant factor. Thus, it is not 
surprising that most of the studies have focused on 1-cycles except for a special case considered in lfl2l . In 
this paper, we use simplex annotations which lead to better solutions to these problems. We only consider 
homology over the field Z2. 

Annotation. An annotation for a p-simplex is a length g binary vector, where g is the rank of the p- 
dimensional homology group. These annotations, when summed up for simplices in a given cycle z, provide 
the coordinate vector of the homology class of z in a pre-determined homology basis. Such coordinates are 
only of length g and thus help us determine efficiently the topological characterization of z. We provide an 
algorithm to compute such annotations in 0(n u ) time, where n is the number of input simplices and uj is 
a quantity so that two n x n matrices can be multiplied in 0(n w ) time. It is known that uj is smaller than 
2.376 0. 

The high-level idea for computing the annotation can be described as follows. We first compute an 
appropriate basis Z of the cycle space Z P (JC) with the following property: each cycle zE2 has a sentinel, 
which is a simplex a z that appears in the cycle z and in no other cycle from Z. We can then express any 
cycle zq efficiently in the basis Z as the addition of the cycles z from Z whose sentinels a z are contained in 
zq. Next, we compute an arbitrary homology basis H of H p (/C). The annotation of any non-sentinel simplex 
is simply 0, while the annotation of a sentinel a z is the coordinates of the homology class [z] in the basis 
H . Because of linearity, the sum of the annotations over the sentinels in a cycle gives the homology class 
of that cycle. We show how matrix decomposition algorithms can be leveraged to compute these bases and 
annotations efficiently. The recent works of Milosavljevic et al. EUl and Chen and Kerber [HI also employ 
fast matrix operations in Computational Topology. 

Annotating the simplices of a simplicial complex with elements of an algebraic object has a long tradi- 
tion in Algebraic Topology. To work with homotopies in a simplicial complex one can find a non-Abelian 
group called fundamental group, described by certain relations, and assign to each 1 -simplex an element of 
the group such that deciding if a path is contractible amounts to testing whether the product of the corre- 
sponding group elements is the identity. This line of work dates back to Poincare, but testing contractibility 
is equivalent to the word problem for groups, and thus undecidable. When the simplicial complex is a 2- 
manifold, Dehn showed in 1912 that the approach leads to efficient computation which was further studied 
in ifTTI . We refer to Stillwell ||23l Chapters 0, 3, 4 and 6] for a comprehensive treatment and historical 
account of annotations to work with homotopies. Annotations have been used extensively to work with 
1 -dimensional homology in surfaces, where they can actually be computed in linear time; see for exam- 
ple J3l[l5j[l6j|22l. However, we are not aware of previous works using annotations to work with homology 
in higher dimensions or in general simplicial complexes. For readers familiar with cohomology, it may be 
worth pointing out that cocycles {4>i}i=i... g whose classes generate the cohomology group provide an an- 
notation by assigning the binary vector (0i(<r), . . . , 4> g (cr)) to simplex a. From this viewpoint, annotations 
can be seen as exposing the classical relation between homology and cohomology groups. 

Applications. Our annotation technique has the following applications. 

1. Using the annotations for edges, we can compute an optimal basis for 1-dimensional homology group 
Hi(/C) in 0(n u + n 2 g w_1 ) time, where g is the first Betti number of a simplicial complex K,. This 
improves the previous 0(n 4 ) best known algorithm for computing an optimal homology basis in 
simplicial complexes lfT3l . 

2. Since it is known that computing an optimal homologous cycle is NP-hard even for 1-cycles EIHIH 



in surfaces, Chambers et al. |4) designed an algorithm taking near-linear time when g is constant. Er- 
ickson and Nayyeri fljfj improved the running time to 2°^n log n, and Italiano et al. |[T8l provide an 
algorithm using g°^ 9 'n loglog n time. Using our annotations together with the approach of Erickson 
and Nayyeri we obtain an algorithm for finding an optimal homologous cycle in simplicial complexes 

in 0(n w ) + 2°^n 2 log n time. 

3. Using annotations for p-simplices, we can determine if a given p-cycle is null homologous or if two 
p-cycles are homologous in time 0(tg) time where t is the number of p-simplices in the given p- 
cycles. Given a set of p-cycles, we can also answer queries about their homology independence. A set 
of p-cycles is called homology independent if they represent a set of linearly independent homology 
classes. A maximal subset of homology independent cycles from a given set of k cycles with t 
simplices can be computed in 0(tg + (g + k)g u ) time after computing the annotations. 

In many applications, g, the dimension of the concerned homology group is small and can be taken as a 
constant. In such cases, the applications listed above benefit considerably, e.g., applications in 1 and 2 run 
in 0(n u ) time. 

Structure of the paper. We introduce appropriate background in Section 12 and describe the matrix oper- 
ations we use in Section|3] In Section|4]we explain how the annotations for edges can be computed to work 
with 1-cycles. In Section [5] we show results on computing an optimal homology basis and an optimal ho- 
mologous cycle using edge-annotations. In Section [6] we explain how to extend the annotation algorithm for 
edges to compute annotations for p-simplices. We next describe some applications of this general annotation 
result in Section [7] We conclude in Section [8] 

2 Background 

Homology. In this paper, we focus on simplicial homology over the field Z2; see comments in the con- 
clusion section for extension to other finite fields. We briefly introduce the notations for chains, cycles, 
boundaries, and homology groups of a simplicial complex, adapted to Z2. The details and general case 
appear in any standard book on algebraic topology such as ll2"ll . 

Let K, be a simplicial complex. Henceforth, we assume that K is connected and use K p to denote the 
set of simplices in K of dimension at most p. To work with the 1 -skeleton we use V = K.$, E = K,\, and 
borrow standard notation from graph theory. 

A p-chain in K is a formal sum of p-simplices, c = J2aeK a o-°~> &o £ <^2- The set of p-chains forms 
a vector space C p (/C) under ^-addition where the empty chain plays the role of identity 0. The chain 
group C p is in one-to-one correspondence to the family of subsets of K p . Hence C p is isomoiphic to the 
n p -dimensional binary vector space (Z2) np , where n p is the number of p-simplices in K.. A natural basis of 
C p consists of the p-simplices in /C. In this basis, the coordinate vector of a p-chain is the incidence vector 
telling which p-simplices appear in the corresponding subset. 

The boundary of a p-simplex is a (p — l)-chain consisting of the set of its (p — l)-faces. This can be 
linearly extended to a boundary map d p : C p — > C p _i, where the boundary of a chain is defined as the sum 
of the boundaries of its elements. Using the natural bases of C p and C p _i, computing the boundary of a 
p-chain corresponds to multiplying the chain vector with a boundary matrix [61 62 • • • b np ] whose column 
vectors are boundaries of p-simplices. We slightly abuse the notation and denote the boundary matrix also 
with d p . 

We define the group ofp-cycles as the kernel of d p , Z p := ker d p , and define the group of p-boundaries 
as the image of d p+ \, B p := im<9p_|_i. The latter is a subgroup of the former. The p-th homology group H p is 
the quotient Z p /B p . Each element in H p , called a homology class, is an equivalence class ofp-cycles whose 



differences are p-boundaries. For any p-cycle z, we use [z] to denote the corresponding homology class. 
Two cycles are homologous when they belong to the same homology class. Note that Z p , B p , and H p are 
also vector spaces. We call their bases a cycle basis, a boundary basis, and a homology basis respectively. 
The dimension of the p-th homology group is called the p-th Betti number. We will denote it by g. A set 
of p-cycles {z\, ■ ■ ■ , z g } is a homology cycle basis if the set of classes {[z±], ■ ■ ■ , [z g ]} forms a homology 
basis. 

Optimization problems. Given a simplicial complex /C, exponentially many cycles may belong to a ho- 
mology class [z\. We consider an optimization problem over such a set with all p-cycles assigned well- 
defined weights. Given a non-negative real weight w(a) for each p-simplex a, we define the weight of a 
cycle as the sum of the weights of its simplices, w(z) = X^ CT gz w {°~)- F° r example, when p = 1 and the 
weights are the lengths of the edges, the weight of a cycle is its length and the optimization problem seeks 
for the shortest cycle in a given class. Formally, we state: 

Problem 2.1. Given a simplicial complex and a cycle z, find the cycle argmin 2 £ u w(zq). 

Next, we consider an optimization problem over the set of all homology cycle bases. The weight of a 
homology cycle basis H is defined as the sum of the weights of its elements, w(H) = YlzeH w i z )- Note 
that a simplex may contribute to the weight multiple times if it belongs to multiple cycles in the basis H. 
Formally, we have the following problem. 

Problem 2.2. Given a simplicial complex, find a homology cycle basis H with minimal w{H). 

3 Efficient Matrix Operations 

Under Z2 coefficients, the groups C p , Z p , B p , and H p are all vector spaces. Linear maps among such 
spaces or change of bases within the same space can be represented by matrices and operations on them. 
Our algorithm computes simplex annotations via manipulations of such matrices and bases. Several of our 
computations use the following concept. 

Definition 3.1 (Earliest Basis). Given a matrix A with rank r, the set of columns B op t = {0^, • • • , a,i r } is 
called the earliest basis if the column indices {ii, ■ ■ ■ ,i r } are the lexicographically smallest index set such 
that the corresponding columns of A have full rank. 

For convenience, we often use the same symbol to denote both a set of vectors and the matrix they form 
and denote by B opt also the matrix [a^ a% 2 - ■ ■ a%^\. It is convenient to consider the following alternative 
view of the earliest basis: a column vector of A is in the earliest basis if and only if it does not belong to the 
subspace generated by column vectors to its left. 

We next summarize the operations on matrices that we need. For simplicity, we assume that the matrix 
multiplication exponent uj > 2; otherwise, some additional logarithmic terms appear in the running times. 

Proposition 3.2. Let A be anm x n matrix of rank r with entries over Z2 where m < n. 

(a) If A is square and has full rank, one can compute its inverse A -1 in 0{n u ) time. 

(b) There is an 0(71^) time algorithm to compute the earliest basis B op t of A. 

(c) In 0(71^) time, one can compute the coordinates of all columns of A in the earliest basis B op t. For- 
mally, one can compute AP = B op t[I r \ R], where P is a permutation matrix, I r is anr x r identity 
matrix, and R is anr x (n — r) matrix. 



Proof. Item (a) appears in Bunch and Hopcroft [1|; alternatively, see iflOl . Item (b) and (c) follows from 
the LSP -decomposition lfT7l[T9l , which can be computed in 0(11^) time. We restate the following result 
for a column version, where we transpose both sides of the standard LSP-decomposition and rename the 
matrices. 

LSP-decomposition H19II . Given an m x n matrix A, one can compute in 0(n u ) time a decomposition 
A = QSU, where Q is an m x m permutation matrix, U is an n x n upper unitriangular matrix, S is an 
mxn matrix with r = rank(j4) non-zero columns which are linearly independent. Furthermore, S is lower 
unitriangular if we remove all zero columns and the lowest m — r rows. 

Notice that the permutation matrix Q only permutes rows and thus does not affect the computation of 
the earliest basis of the column rank. By definition, the indices of non-zero columns of S (called the column 
rank profile) are the indices of the earliest basis, and thus can be computed by applying LSP-decomposition 
once. Therefore item (b) follows. 

Next, we prove item (c). Due to item (b), we know the indices of the earliest basis B opt in A and we 
can compute a column permutation matrix P so that AP has the first r columns as this earliest basis. Next, 
we compute the LSP-decomposition of the matrix AP = QSU. Since indices of the non-zero columns in 
S correspond to those of the earliest basis of AP, the last n — r columns of S are necessarily zero. Hence 
we can rewrite the LSP decomposition as 



AP = QSU = Q[5i I 0] 



U n U 12 
U 22 



where Si has size m x r, Un has size r x r and is upper unitriangular, U12 has size r x (n — r), and U22 
has size (n — r) x (n — r). Evaluating the right side, we have 

AP = QSiUnllr I U^Uu]. 

Since the first r columns of the matrix on right hand side equal to QS\U\i, and B opt consists of the first 
r columns of AP by definition of P, we see that QSiUn = B opt . Setting R := U^Ui2 we obtain the 
desired decomposition AP = B opt [I r \ R\- The algorithm involves one computation of an earliest basis, 
one LSP-decomposition, and constant number of matrix inversions and multiplications. Since each of these 
operations takes 0(71^) time, item (c) follows. □ 

4 Annotating Edges 

Let K, be a given simplicial complex. First, we define annotations in general terms using g for the dimension 
of H p (/C). 

Definition 4.1 (Annotations). An annotation for p-simplices is a function a: K p — >■ (Z2) 9 with the following 
property: any two p-cycles z and z' belong to the same homology class if and only 

^a(o-) = ^a(a). 

o-ez aez' 

Given an annotation a, the annotation of any p-cycle z is defined by &{z) = ^2 aez a(cr). 

We will construct annotations using coordinate vectors of cycles in a homology basis. Let H = 
(hi, h-2, ■ ■ ■ , h g ) be a basis of the vector space H p (/C). For a p-cycle z, if [z] = Ylt=i ^i^i where each 
Xi E Z2, then the coordinate vector of [z] in H is (Ai, . . . , \ g ) £ (Z2) 9 . The question is how to annotate the 
p-simplices so that the sum of annotations in the simplices of z gives (Ai, . . . , X g ). 



In this section, we explain the technique for annotating edges. An extension to p-simplices is explained 
in Section [6] We compute edge annotations in three steps. First, we construct a cycle basis Z in which any 
cycle can be expressed in simple and efficient terms. Second, we find a homology cycle basis H. Last, 
we compute the homology of each cycle in Z in the homology cycle basis H. From this information, one 
can compute the homology class of any other cycle using vector sums in the coordinate system provided by 
H. The approach is based on using a spanning tree of the 1 -skeleton to generate the space of cycles. The 
approach of using a spanning tree to generate the fundamental group and then obtain the homology group, 
is well known in topology and has been used extensively; see for example lfl4l [161 l23l 

Step 1: Computing a cycle basis Z. Let us fix throughout this section a spanning tree T in the 1-skeleton 
of /C; it contains no — 1 edges. Let k = m — no + 1 be the number of edges in E \ E(T). We fix 
an enumeration ei, . . . , e ni of the edges of E with the property that the edges ei, . . . , e& are precisely the 
edges of E \ E(T). Thus, efc+i, . . . , e ni are the edges of T. The edges of E \ E{T) are called sentinel 
edges, while the edges of E(T) are non-sentinel edges. 

For any sentinel edge e G E \ E(T), denote by j(T, e) the cycle corresponding to the unique simple 
path that connects the endpoints of e in T plus the edge e. We call it a sentinel cycle. Let Z be the set of 
such sentinel cycles {^y(T, ei),j(T, e<i), . . . , j(T, e^)}. We have the following property: a sentinel edge e$ 
belongs to a sentinel cycle j(T, ej) if and only if i = j. For completeness, we set j(T, e) = when e 
belongs to T. The following result is probably folklore. 

Proposition 4.2 (Cycle basis). Z is a cycle basis and for any cycle z € Zi we have z = X^egz 7(^> e )- 

Proof. Since the edge e\ € E\E(T) does not appear in j(T, e^), ■ ■ ■ , j(T, e^), the cycle j(T, e\) is linearly 
independent of j(T, e^), ■ ■ ■ , 7(T, e^). The same argument applies to any cycle j(T, ei) € Z, and thus the 
cycles of Z are linearly independent. Since k = dim(Zi), Z is indeed a basis for Zi. 

To show the second half of the claim, fix an arbitrary cycle z. Fix a vertex s of the tree T and for any 
vertex u of T, let T[s, u] denote the unique simple path in T connecting u to s. We then have: 



z = E e = E (T[s,u]+e + T[s,v]) = J>(T,e). 

e=uv£z e=uvdz egz 

The second equality holds as every vertex in the cycle z is the endpoint of an even number of edges of z. □ 

Step 2: Computing a homology cycle basis H. In this step, we compute a homology cycle basis H from 
Z with the help of Proposition I3.2f b). Specifically, we construct a new matrix [&2 \ Z] with the submatrix 
Z being formed by the chain vectors of cycles in Z. We compute the earliest basis Z = [B \ H] of [82 \ Z] 
where B contains the first r = rank(<92) columns of Z. Since the set of columns of 82 generates the 
boundary group, by the definition of earliest basis, it is necessary that the columns in B come from 82 and 
form a boundary cycle basis. Since Z and hence 82 U Z generates the cycle group, the remaining columns 
of Z, namely H, form a homology cycle basis. 

Step 3: Computing annotations. Finally, for elements of Z we compute their coordinates in the cycle 
basis Z. For each sentinel cycle z = j(T, e), we compute its coordinate vector in Z by solving the linear 
system Zx = z. The last g entries of x give its coordinates in the basis H. We use this length g vector as 
the annotation of the sentinel edge e. We can compute annotations for all sentinel edges together by solving 
ZX = Z and taking the last g rows of the solution X. For a non-sentinel edge, we simply set its annotation 
to be the zero vector. 



An example of annotation for a 2-complex is shown 
on right. The edges of a spanning tree are shown with 
thicker edges. The edges e\,e2,es, e^ are sentinel edges. 
The cycles given by sentinel edges e2 and e% form the 
homology cycle basis H computed by the algorithm. 
We show the annotations for the sentinel edges; all other 
edges get annotation (0,0). The annotation of (1, 1) for 
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e4 makes it possible to evaluate the cycle 626364 to (0, 0) as it is null-homologous and also evaluate the 
outer boundary to (1,1) as it is homologous to the sum of the two holes. 

Theorem 4.3. The algorithm described above computes an annotation of length dim(Hi (/C)) for the edges 
of a 2-complex K, in 0(11^) time, where n is the size oflC. 

Proof. From Step 3, the annotation of a sentinel edge e is exactly the coordinate vector of the homology 
class [y(T, e)]. It then follows from Proposition 14.21 that, for any cycle z, the coordinate vector of the 
homology class [z] is simply the summation of annotations of all edges in z. For the time complexity, notice 
that Step 1 requires computing a spanning tree and the cycle basis Z, which takes 0(n 2 ) time. Steps 2 and 
3 take 0{n w ) time because of Proposition 13.21 b) and (a) respectively. □ 



5 Optimality for 1 -cycles 

5.1 Shortest homology basis 

In this section we discuss the problem of computing an optimal homology basis for one dimensional ho- 
mology Hi. The optimal homology cycle basis here is the shortest homology basis since we minimize the 
weights / lengths. We present an efficient algorithm that combines the approach of Erickson and Whit- 
tlesey lfl6l and our annotation technique. The approach restricts the search to a well-structured family of 
cycles, represents each cycle in this family with a length-^ binary vector, and then reduces the computation 
to the problem of finding an earliest basis in a matrix of size g x n 2 . 

For each vertex s E V, let T s be the shortest path tree from s with respect to the weight function. Denote 
by Z s the set of sentinel cycles corresponding to this tree T s and II the union of Z s for all s S V, that is, 

11= \JZ S = \J{ 7 (T s ,e)\eeE\E(T s )}. 

sGV s&V 

The following property was noted by Erickson and Whittlesey lfl6l . See also Dey, Sun, and Wang |[T3l for 
an extension. 

Proposition 5.1. If we sort the cycles of II in non-decreasing order of their weights, the earliest basis o/TI 
is a shortest homology basis. 

Theorem 5.2. Let K, be a simplicial complex of size n. We can find a shortest homology basis in time 

0(n^ + ji 2 ^" 1 ) where g = rank(Hi(/C)). 

Proof. By Theorem 14.3 1 we compute annotations for all edges in 0(n^) time. Let a(e) denote such annota- 
tion for any edge e, and a(z) = J2eez a ( e ) f° r an y 1-cycle z. 

Next we compute annotations for all cycles z G II. Instead of computing them one by one, we annotate 
all cycles in Z s at once for each s. Given a fixed s, we first compute T s in 0(n log n) time. We assign a 
g-long label £(x) to each vertex x G V. The label £(x) is the label £(x') of its parent x' plus the annotation 
of the edge xx' , a(xx'). We compute labels for all vertices in 0(ng) time by a breadth-first traversal of 



T s . Afterward, the annotation of any sentinel cycle j(T s ,xy) E Z s is computed in 0(g) time as l(x) + 
£(y) + a(xy). Thus, we can compute the annotations for all cycles in Z s in 0(ng) time given T s and edge 
annotations. To annotate all cycles of II, we repeat the procedure for all source vertices s. Computing 
annotations for all cycles thus takes 0(n 2 g + n 2 logn) time. 

Since annotations of cycles give us the homology classes they belong to, we can use them to find a 
shortest homology basis. We sort cycles in II in non-decreasing order of their weights in 0(n 2 log n) time. 
Let zi, Z2, 23, • • ■ be the resulting ordering. We construct a matrix A whose ith column is the vector &(zi), 
and compute its earliest basis. By Proposition 15. II the cycles defining the earliest basis of A form a shortest 
homology basis. Since there are up to n 2 elements in IT, the matrix A has size gxn 2 , and thus it is inefficient 
to compute its earliest basis using Proposition I3.2l directlv. Instead, we use the following iterative method to 
compute the set J of indices of columns that define the earliest basis. 

We partition A from left to right into submatrices A = \A\ jy^l ■ ■ ■ ] where each submatrix Ai contains 
g columns with the possible exception of the last submatrix which contains at most g columns. Initially, we 
set J to be the empty set. We then iterate over the submatrices A4 by increasing index. At each iteration 
we compute the earliest basis for the matrix [.Aj|.Aj] where Aj is the submatrix whose column indices are 
in J. We then set J to be the indices from the resulting earliest basis, increment i, and proceed to the 
next iteration. At each iteration we need to compute the earliest basis in a matrix with g rows and at most 
\J\ + 9 < 2g columns. There are at most 0(n 2 /g) iterations each taking 0(g u ) time. 

We obtain the claimed time bound by adding up the time to annotate edges, annotate cycles in IT, and 
compute the earliest basis. □ 



5.2 Shortest homologous cycle 

In this section, we show how to compute the shortest cycle in a given one-dimensional homology class. In 
fact, within the same running time, we can compute a shortest cycle in each homology class. The idea is to 
use covering graphs, and it closely resembles the approach of Erickson and Nayyeri lfl5l . We skip most of 
the details because of this similarity. Nevertheless, our main contribution is the use of the annotations from 
Section |4] 



Let G = (V, E) be the 1-skeleton of K. We first V x {(0, 0)} 



compute an annotation a: E — >■ (Z2) 9 , as given by 



Vx{(l,0)} 



Theorem 14.31 A walk in G is a sequence of vertices 
xqX\ . . . xt connected by edges in E. It is closed if 
xq = x t . In this section we keep using the term cycle 
for elements of Z\. Each closed walk in G defines a 
cycle, where only edges appearing an odd number of 
times in the walk are kept. The annotation &(w) of a 
walk w = xoxi . . . xt is defined as the sum of the an- 
notations of its edges Xi-\Xi for i = 1, . . . , t. Notice 
that the annotation a(w) of a closed walk w is the an- 
notation of the cycle defined by w, as annotations in 
edges appearing an even number of times in the walk 
cancel out. 

We construct a covering graph G of the 1-skeleton of /C, defined as follows: 

• V{G) = V x (Z 2 ) 9 . 

• vertex (v, h) € V X (Z2) 3 is adjacent to (v 1 , h') € V x (Z2) 9 if and only if e = vv' is an edge of E 
and h! = h + a(e). The weight of an edge (v, h)(v', h') is the weight of vv'. 




Vx{(0,l)} 



Vx{(l,l)} 



The graph G has no • 2 9 vertices and n\-2 9 edges. The covering graph for the example shown previously 
in section |4]for annotation is depicted above. The second coordinate of a vertex (v, h) G V(G) is used to 
encode the homology of cycles, as we will see. Similar covering graphs are used, for example, in ll3l [T5ll22l . 

Proposition 5.3. For all h £ (Z2) 9 , we can compute a shortest walk Wh in G among all closed walks with 
annotation h in 0(2 9 n 2 (g + log n)) time. 

We say that a cycle is elementary if it is connected and each vertex is adjacent to at most two edges 
of the cycle. Each cycle is the union of edge-disjoint elementary cycles. First, we bound the number 
of elementary cycles in optimal solutions and then use dynamic programming across annotations and the 
number of elementary cycles to obtain the following 

Proposition 5.4. The shortest cycle in any given homology class consists of at most g elementary cycles. 

Theorem 5.5. In O^ + 4 9 g + 2 9 n 2 (g + logra)) = (^{n") + 2°^n 2 logn time we can compute the 
shortest homology cycle for all homology classes in Hi. 

6 Annotating p-simplices 

In this section, we show how to compute annotations for p-simplices. Notice that the only thing we need 
to generalize is the first step: find a set E of p-simplices {sentinel simplices) with cardinality dim (Z p ) and 
a cycle basis Z = {z a }o- e z {sentinel cycles) for the p-cycle group Z p with the property that z a contains 
a' E E if and only if a = a'. With this property, any p-cycle z can then be written as z = ^o-ezns z °- 
Taking z a = for all a E, we have z = J2 a ez z °- With such a basis, we proceed with Step 2 and 3 just 
like in the case for edges to annotate p-simplices. Below, we explain how to compute such a cycle basis Z. 

In the case for annotating edges, we first fix a spanning tree. The boundaries of its edges form a 0- 
dimensional boundary basis. Any of the remaining edges when added to the tree creates a unique 1-cycle 
which is kept associated with this edge as a sentinel cycle. For p-simplices, p > 1, we do not have a spanning 
tree, but Proposition 13 .2f c) provides us an algebraic tool that serves the same puipose. 

Specifically, consider the n p ^\ x n p boundary matrix d p of rank r, where the i-th column in d p corre- 
sponds to the {p — l)-boundary of p-simplex ctj. Using Proposition 13.21 c) we can obtain an n p x n p matrix 
P, an n p -\ x r matrix B opt , and an r x {n p — r) matrix R so that 

8 P P = B opt [I r \ R]. 

Notice that P permutes the p-simplices so that the first r columns of d p P form the earliest basis B opt . By 
reordering the columns of d p , we may assume that P is the identity, giving d p = B opt [I r \ R\- In this 
scenario, the columns of B opt form a basis of the column-space of d p , and contains the first r = rank (d p ) 
columns of d p . Note that the i-th column in [I r \ R] gives the coordinate vector of the boundary cycle for o", 
in the boundary basis Bopt. 

Take the first r p-simplices {o~\, ... ,a r }. Their boundaries are linearly independent. Therefore, no 
subset of them can form a p-cycle. In analogy to Section 01 we use T for this collection of p-simplices and 
call them non-sentinel simplices. The set S = JC P \ T of p-simplices are the sentinel simplices. 

Now consider any sentinel p-simplex, say a r+ i € E. Its boundary is the (r + i)-th column in d p and is 
equal to B opt R[i], where R[i] is the i-th column of R. This means that d p a r+ i = X^=i R\j> i](dp a j) where 
R[j, i] is the j-th entry in the i-th column of R. Hence taking the set of p-simplices aj, j € [1, r], whose 
corresponding entries R[j,i] are 1, plus a r+ i itself, we obtain a p-cycle j(T, a r +i)- We call this p-cycle 
a sentinel cycle. Similar to Section [4j we set j(T, a) = for each non-sentinel simplex a € T. Clearly, 
7(T, o~ r +i) can only contain one simplex from E which is oy+j. We have the desired property: a sentinel 



simplex a € £ belongs to a sentinel cycle 7(T, a') if and only if a = a'. Finally, observe that the columns 
give the set of sentinel cycles Z. The (n p — r) x (n p — r) identity matrix I np -r associates 



of 

each sentine 



Ifiv-r 



cycle 7(T, a) in Z to its sentinel p-simplex a. Similar to Proposition 14.21 we have: 



Proposition 6.1. Z = J7(T, o>+i)i • • • , 7(T, o~ np )} is a cycle basis, and for any p-cycle z we have z = 

£ CT62 7(T,a). 

Proof. Z is linearly independent since each cycle contains a unique sentinel simplex. Since Z has n p — r = 
n p — rank(<9 p ) = dim(Z p ) elements, Z forms a basis for Z p . An arbitrary p-cycle z has a unique coordinate 
in the cycle basis Z. Since each sentinel simplex a G £ belongs to one and only one cycle in Z, the 
corresponding coordinate is one for a cycle j(T, a) if and only if a € z. □ 

Combining this proposition with Step 2 and 3 from Sectional we obtain the following theorem. 

Theorem 6.2. We can annotate the p-simplices in a simplicial complex with n simplices in 0{n u ) time. 

7 Null Homology and Independence 

Our annotation algorithm can be used to address some of the computational problems involving p-cycles. 

Null homology. A p-cycle z in a simplicial complex K is called null homologous if [z] = 0. A cycle is 
null homologous if and only if it has zero coordinates in some and hence any basis of H p (/C). Consider the 
problem: 

Ql: Given a p-cycle z in a simplicial complex /C, decide if z is null homologous. 

A p-cycle z is null homologous if and only if it is linearly dependent to columns of d p+ \. This could be 
determined by checking whether z belongs to the earliest basis of the matrix [d p+ \ \ z\. The complexity of 
such computation is 0(n w ) (Proposition 13.21 b)). 

However, with annotations whose computations take 0{n^) time, we can improve the query time for 
Ql to 0(tg) where g = dim H p (/C) and t is the number of p-simplices in z. For this we simply add the 
annotations of the p-simplices in z and check if the result is zero, which takes 0(tg) time. Now consider 
the following decision problem: 

Q2: Given two p-cycles z\ and Z2 in a simplicial complex /C, decide if z\ and Z2 are homolo- 
gous. 

The problem Q2 reduces to Ql because z\ and Z2 are homologous if and only if z\ + Z2 is null homologous. 
Therefore, Q2 can be answered in 0((t\ + ^2)5) time after 0(n w ) time preprocessing where t\ and £2 are 
the number of p-simplices in z\ and Z2 respectively. 

Independence. An analogous problem to testing null homology is the problem of testing independence. 

Q3: Find a maximally independent subset of a given set of p-dimensional homology classes 
[z\], . . . , [zk] in a simplicial complex JC. 

Without our annotation technique, for each such query, we could apply Proposition 13 .2f b) to the nx (n + k) 
matrix [d p+ \ \ z\ z^- • • z^\. The set of cycles ZjS belonging to the earliest basis would represent a linearly 
independent set of classes. The overall complexity is 0((n + k)^). Using the iterative technique delineated 
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in Theorem l5.2l we can improve the query time to 0(71^(11 + k)/n) = 0(n u + kn^^ 1 ). However, with our 
annotation technique, we could improve the query time. 

Compute the annotations for all edges in 0(11,^) time. By Theorem 16.21 we can then compute an an- 
notation &(zi) in 0(tg) time for all cycles Zi&, where t is the number of simplices altogether in all cycles 
and each &(zi) is a length-^ vector. Now construct a matrix A whose ith column is the vector a(zj). Notice 
that the earliest basis of A form a maximally independent subset of column vectors in A. Thus the set of 
cycles z,s corresponding to columns in this earliest basis form a maximally independent subset of the input 
set of p-cycles. Since A is of size g x fc, we have an 0(g w + k w ) query time, after an 0{n w ) preprocessing 
time. We can improve the query time to 0((g + k)g u ~ l ) by using the iterative technique in Theorem 15.21 
Therefore, total time for computing a maximally independent set takes 0(tg + (g + k)g w ~ x ) time after 
annotations. 

8 Conclusions 

In this paper, we present an algorithm to annotate p-simplices in a complex so that computations about the 
homology groups can be done faster. We have shown its applications to some problems that concern with the 
optimality of 1-cycles and topological characterizations of the p-cycles. The algorithm uses operations such 
as matrix inversion and matrix multiplication, and thus can take advantage of the best known algorithms for 
these operations, which take o(?i 3 ) time. 

For denning the weights of a cycle we used 1-norm to combine the weights of the individual edges. For 
defining the weight of a basis we also used 1-norm to combine the weights of the basis cycles. In these 
problems we can use any other norm to define these weights. 

One may wonder why we cannot extend our annotation approach to compute the optimal homology 
basis or the optimal homologous cycle for higher dimensional cycles. The main bottleneck for finding an 
optimal basis is that the Proposition 15. II does not generalize to higher dimensional cycles. Given that the 
problems are NP-hard in high dimensions even for g = 1 Q, such extensions cannot exist unless P=NP 

Instead of computing the shortest homology basis, one may want to compute a set of edges with minimal 
total weight which supports a homology cycle basis. In such case, the algorithm in Theorem 15.21 cannot be 
used. The annotations and the covering graph might help. 

Finally, we point out that one can use any finite field instead of Z2 for annotations. Since annotations 
mainly utilize matrix multiplications which remain valid under any field, the annotation algorithm in sec- 
tion [6] remains applicable without any change. However, the optimal cycles in sections [5] and 15.21 require 
computations of cycles associated with shortest paths which we do not know how to generalize for general 
fields. Specifically, it is not clear how to obtain results for applying Propositions 15. Il and l5.3l This could be 
a topic of further research. 
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A Omitted proofs 

Proof of Proposition 15.31 We first argue that there is a bijection between closed walks in G through a vertex 
v with annotation h and walks in G from vertex (v, 0) to vertex (v, h). Indeed, assume first that there is a 
closed walk w = £0^1 • • • xt in G with xt = xq = v. By the definition of annotation we have 

a ( w ) = ^2 a ( x i~l x i)- 
i=l,...,t 

Define yo = (^o> 0) and define, for each vertex Xi in w, the vertex yi = (xi, J2j<i a i x j-l x j))- By con ~ 
struction, there is an edge in G between yi and j/i+i, and therefore y$y\ • • • yt is a path in G from (v , 0) 
to (v, &(w)). Conversely, for any walk in G from vertex (v, 0) to vertex (v, h), the projection into the first 
coordinate provides a closed walk through v whose annotation is h. 

For all v G V, we compute the shortest paths from (v, 0) to all vertices in G, and record the lengths. This 
is |V| computations of shortest path trees in G and thus takes 0(n(2 9 nlog(2 9 n)) = 0(2 9 n 2 (g + logn)) 
time (Erickson and Nayyeri |[T5l can speed up this step because in their case G is embedded in a surface.) 
The closed walk Wh is then obtained by considering the shortest path in G between (v, 0) and (v, h), over 
all vertices v G V, and then taking its projection onto G. □ 

Proof of Proposition 15.41 We can assume that the homology class is nonzero, as otherwise the result 
is trivial. Assume for contradiction that the shortest cycle z in a given homology class contains more 
than g elementary cycles z\, ■ ■ ■ , zt, where t > g. Each elementary cycle is a cycle by itself. Since the 
homology classes of these cycles are not linearly independent, there exists a set of indices / C {1, . . . , t} 
such that = X^ig/N]- Notice that / ^ {1, . . . , t} as otherwise [z] = SiGii t\l z i\ = 0- ^ ne c Y c l e 
z' = X^eji t\\i z i i s strictly shorter than z, and represents the same homology class as z. This contradicts 
the assumption that z is shortest in its class. □ 

Proof of Theorem 15.51 We compute annotations for the edges in O^^) time using Theorem 14. 3 1 For 
each h € (Z2) 9 we compute the closed walk Wh given in Proposition 15.31 This takes 0{2 9 n 2 {g + logn)) 
time. 

For any h G (Z^) 9 and any integer k € [1, g], we define C(h, k) as follows 

C (h,k) = h gthoiWh ifk=1; 

\mm{C(hu k - 1) + C{h 2 , l)\h = h 1 + h 2 } if k > 1. 

It is straightforward to see by induction that C(h, k) is an upper bound on the length of the shortest cycle 
with annotation h. Most interestingly, C(h,g) is the length of the shortest cycle with annotation h. Indeed, 
if the shortest cycle with annotation h consists of the elementary cycles z\, . . . ,z t with t < g, then it follows 
by induction that, for any I C {1, . . . t}, C (J2iei a i z i)i l-^l) i s tne length of (J2iei z '>)- 

A standard dynamic programming algorithm to compute C(h, k) takes 0{2 9 ) time per element, for a 
total of 0{4Pg) time. We then return for each h G (Z2) 9 the cycle defining C(h,g). (A shortest cycle 
homologous to a given cycle z is obtained as the cycle defining C(a(z),g).) □ 
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